Picture for Jiatong Shi

Jiatong Shi

Bagpiper: Solving Open-Ended Audio Tasks via Rich Captions

Add code
Feb 05, 2026
Viaarxiv icon

Optimizing Conversational Quality in Spoken Dialogue Systems with Reinforcement Learning from AI Feedback

Add code
Jan 27, 2026
Viaarxiv icon

Do Neural Codecs Generalize? A Controlled Study Across Unseen Languages and Non-Speech Tasks

Add code
Jan 18, 2026
Viaarxiv icon

IKFST: IOO and KOO Algorithms for Accelerated and Precise WFST-based End-to-End Automatic Speech Recognition

Add code
Jan 01, 2026
Viaarxiv icon

BSCodec: A Band-Split Neural Codec for High-Quality Universal Audio Reconstruction

Add code
Nov 08, 2025
Figure 1 for BSCodec: A Band-Split Neural Codec for High-Quality Universal Audio Reconstruction
Figure 2 for BSCodec: A Band-Split Neural Codec for High-Quality Universal Audio Reconstruction
Figure 3 for BSCodec: A Band-Split Neural Codec for High-Quality Universal Audio Reconstruction
Figure 4 for BSCodec: A Band-Split Neural Codec for High-Quality Universal Audio Reconstruction
Viaarxiv icon

Full-Duplex-Bench-v2: A Multi-Turn Evaluation Framework for Duplex Dialogue Systems with an Automated Examiner

Add code
Oct 09, 2025
Viaarxiv icon

SingMOS-Pro: An Comprehensive Benchmark for Singing Quality Assessment

Add code
Oct 02, 2025
Figure 1 for SingMOS-Pro: An Comprehensive Benchmark for Singing Quality Assessment
Figure 2 for SingMOS-Pro: An Comprehensive Benchmark for Singing Quality Assessment
Figure 3 for SingMOS-Pro: An Comprehensive Benchmark for Singing Quality Assessment
Figure 4 for SingMOS-Pro: An Comprehensive Benchmark for Singing Quality Assessment
Viaarxiv icon

Chain-of-Thought Reasoning in Streaming Full-Duplex End-to-End Spoken Dialogue Systems

Add code
Oct 02, 2025
Viaarxiv icon

The Singing Voice Conversion Challenge 2025: From Singer Identity Conversion To Singing Style Conversion

Add code
Sep 19, 2025
Viaarxiv icon

The ML-SUPERB 2.0 Challenge: Towards Inclusive ASR Benchmarking for All Language Varieties

Add code
Sep 08, 2025
Figure 1 for The ML-SUPERB 2.0 Challenge: Towards Inclusive ASR Benchmarking for All Language Varieties
Figure 2 for The ML-SUPERB 2.0 Challenge: Towards Inclusive ASR Benchmarking for All Language Varieties
Figure 3 for The ML-SUPERB 2.0 Challenge: Towards Inclusive ASR Benchmarking for All Language Varieties
Figure 4 for The ML-SUPERB 2.0 Challenge: Towards Inclusive ASR Benchmarking for All Language Varieties
Viaarxiv icon